互补的时尚推荐旨在识别来自不同类别(例如衬衫,鞋类等)的项目,这些项目“很好地融合在一起”是一件服装。大多数现有方法使用包含手动策划的兼容项目组合的标记的Outfit数据集学习此任务的表示形式。在这项工作中,我们建议通过利用人们经常穿兼容服装的事实来学习从野外街头时尚图像进行兼容性预测的表示形式。我们制定的借口任务是使同一个人所穿的不同物品的表示形式与其他人所穿的物品相比更接近。此外,为了减少推理期间野外和目录图像之间的域间隙,我们引入了对抗性损失,以最大程度地减少两个域之间特征分布的差异。我们对两个流行的时尚兼容性基准进行了实验 - 多视频和多视频搭配服装,并优于现有的自我监督方法,在跨数据库环境中尤其重要,在跨数据库设置中,训练和测试图像来自不同来源。
translated by 谷歌翻译
The number of international benchmarking competitions is steadily increasing in various fields of machine learning (ML) research and practice. So far, however, little is known about the common practice as well as bottlenecks faced by the community in tackling the research questions posed. To shed light on the status quo of algorithm development in the specific field of biomedical imaging analysis, we designed an international survey that was issued to all participants of challenges conducted in conjunction with the IEEE ISBI 2021 and MICCAI 2021 conferences (80 competitions in total). The survey covered participants' expertise and working environments, their chosen strategies, as well as algorithm characteristics. A median of 72% challenge participants took part in the survey. According to our results, knowledge exchange was the primary incentive (70%) for participation, while the reception of prize money played only a minor role (16%). While a median of 80 working hours was spent on method development, a large portion of participants stated that they did not have enough time for method development (32%). 25% perceived the infrastructure to be a bottleneck. Overall, 94% of all solutions were deep learning-based. Of these, 84% were based on standard architectures. 43% of the respondents reported that the data samples (e.g., images) were too large to be processed at once. This was most commonly addressed by patch-based training (69%), downsampling (37%), and solving 3D analysis tasks as a series of 2D tasks. K-fold cross-validation on the training set was performed by only 37% of the participants and only 50% of the participants performed ensembling based on multiple identical models (61%) or heterogeneous models (39%). 48% of the respondents applied postprocessing steps.
translated by 谷歌翻译
Reinforcement learning (RL) has shown great promise with algorithms learning in environments with large state and action spaces purely from scalar reward signals. A crucial challenge for current deep RL algorithms is that they require a tremendous amount of environment interactions for learning. This can be infeasible in situations where such interactions are expensive; such as in robotics. Offline RL algorithms try to address this issue by bootstrapping the learning process from existing logged data without needing to interact with the environment from the very beginning. While online RL algorithms are typically evaluated as a function of the number of environment interactions, there exists no single established protocol for evaluating offline RL methods.In this paper, we propose a sequential approach to evaluate offline RL algorithms as a function of the training set size and thus by their data efficiency. Sequential evaluation provides valuable insights into the data efficiency of the learning process and the robustness of algorithms to distribution changes in the dataset while also harmonizing the visualization of the offline and online learning phases. Our approach is generally applicable and easy to implement. We compare several existing offline RL algorithms using this approach and present insights from a variety of tasks and offline datasets.
translated by 谷歌翻译
Large language models (LLMs) have been shown to be able to perform new tasks based on a few demonstrations or natural language instructions. While these capabilities have led to widespread adoption, most LLMs are developed by resource-rich organizations and are frequently kept from the public. As a step towards democratizing this powerful technology, we present BLOOM, a 176B-parameter open-access language model designed and built thanks to a collaboration of hundreds of researchers. BLOOM is a decoder-only Transformer language model that was trained on the ROOTS corpus, a dataset comprising hundreds of sources in 46 natural and 13 programming languages (59 in total). We find that BLOOM achieves competitive performance on a wide variety of benchmarks, with stronger results after undergoing multitask prompted finetuning. To facilitate future research and applications using LLMs, we publicly release our models and code under the Responsible AI License.
translated by 谷歌翻译
Machine learning is the study of computer algorithms that can automatically improve based on data and experience. Machine learning algorithms build a model from sample data, called training data, to make predictions or judgments without being explicitly programmed to do so. A variety of wellknown machine learning algorithms have been developed for use in the field of computer science to analyze data. This paper introduced a new machine learning algorithm called impact learning. Impact learning is a supervised learning algorithm that can be consolidated in both classification and regression problems. It can furthermore manifest its superiority in analyzing competitive data. This algorithm is remarkable for learning from the competitive situation and the competition comes from the effects of autonomous features. It is prepared by the impacts of the highlights from the intrinsic rate of natural increase (RNI). We, moreover, manifest the prevalence of the impact learning over the conventional machine learning algorithm.
translated by 谷歌翻译
机器人车使用成本图来规划无碰撞路径。与地图中的每个单元相关的成本表示感知的环境信息,这些信息通常是在经过几次反复试验后手动确定的。在越野环境中,由于存在几种类型的功能,将与每个功能相关的成本值进行手工制作是挑战。此外,不同手工制作的成本值可以导致相同环境的不同路径,而不可取的环境。在本文中,我们解决了从感知的稳健车辆路径计划中学习成本图值的问题。我们使用深度学习方法提出了一个名为“骆驼”的新颖框架,该方法通过演示来学习参数,从而为路径规划提供适应性和强大的成本图。骆驼已接受过多模式数据集的培训,例如Rellis-3D。骆驼的评估是在越野场景模拟器(MAV)和IISER-B校园的现场数据上进行的。我们还在地面流动站上执行了骆驼的现实实施。结果表明,在非结构化的地形上没有碰撞的情况下,车辆的灵活而强大的运动。
translated by 谷歌翻译
大多数强化学习算法都利用了经验重播缓冲液,以反复对代理商过去观察到的样本进行训练。这样可以防止灾难性的遗忘,但是仅仅对每个样本都分配了同等的重要性是一种天真的策略。在本文中,我们提出了一种根据样本可以从样本中学到多少样本确定样本优先级的方法。我们将样本的学习能力定义为随着时间的推移,与该样品相关的训练损失的稳定减少。我们开发了一种算法,以优先考虑具有较高学习能力的样本,同时将优先级较低,为那些难以学习的样本,通常是由噪声或随机性引起的。我们从经验上表明,我们的方法比随机抽样更强大,而且比仅在训练损失方面优先排序更好,即时间差损失,这是在香草优先的经验重播中使用的。
translated by 谷歌翻译
随着迷你无人机在平民工作领域变得越来越有用,他们需要在混乱的环境中安全操作的方法的需求正在增长,尤其是对于固定翼无人机而言,由于它们无法遵循停止执行的执行方法。本文提出了初步研究,以设计一种基于人工电位场算法中使用的排斥力的改善的定义,以设计一种反应性碰撞算法,以允许在杂乱无章的动态环境中可行且安全的固定翼无人机导航。我们在多种情况下提出了改进定义的模拟结果,我们还讨论了未来的研究以改善这些结果。
translated by 谷歌翻译
在基本层面上,大多数物理方程式都是时间可逆的。在本文中,我们提出了一个在离散计算级别保存该属性的集成商。我们的模拟可以向后和向后运行,并准确地追踪相同的路径。我们通过使用固定和浮点算术的混合来实现理论上可逆的集成器来实现这一目标。我们的主要应用程序是在优化中使用的伴随方法中有效地实现反向步骤。我们的集成商在差分模拟和机器学习(反向传播)中具有应用。
translated by 谷歌翻译
在线学习环境(即联合学习(FL))中,合作学习范式的迅速增加。与大多数FL设置不同,在许多情况下,代理商具有竞争力。每个代理商都想向他人学习,但是它为他人分享的信息的一部分可能很敏感。因此,它希望其隐私。这项工作调查了一组代理人同时工作,以解决类似的组合匪徒问题,同时保持质量约束。这些代理商可以通过使用差异隐私来保密其敏感信息的同时学习敏感信息吗?我们观察到交流可以减少遗憾。但是,保护敏感信息的差异隐私技术使数据嘈杂,并且可能会恶化,而不是帮助改善遗憾。因此,我们注意到,必须决定何时交流以及学习哪些共享数据以学会在遗憾和隐私之间取得功能平衡。对于这样的联合组合MAB设置,我们提出了一个保护隐私的联合联合组合匪徒,P-FCB。我们通过模拟说明了P-FCB的功效。我们进一步表明,我们的算法在遗憾方面提供了改善,同时维护质量阈值和有意义的隐私保证。
translated by 谷歌翻译